Monophonic sound source separation with an unsupervised network of spiking neurones

نویسندگان

  • Ramin Pichevar
  • Jean Rouat
چکیده

We incorporate auditory-based features into an unconventional pattern classification system, consisting of a network of spiking neurones with dynamical and multiplicative synapses. Although the network does not need any training and is autonomous, the analysis is dynamic and capable of extracting multiple features and maps. The neural network allows computing a binary mask that acts as a dynamic switch on a speech vocoder made of an FIR gammatone analysis/synthesis bank of 256 filters. We report experiments on separation of speech from various intruding sounds (siren, telephone bell, speech, etc.) and compare our approach to other techniques by using the Log Spectral Distortion (LSD) metric.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cochleotopic/AMtopic (CAM) and Cochleotopic/Spectrotopic (CSM) map based sound sourcce separation using relaxatio oscillatory neurons

We use a two-layered unsupervised bio-inspired neural network to segregate sound sources, e.g. double-vowels or vowels intruded by nonstationary noise sources. The network consists of spiking neurons. The spiking neurons in both layers are modelized by relaxation oscillators. The first layer of the network is locally connected, while the second layer is a fully connected network. We show that i...

متن کامل

Towards Neurocomputational Speech and Sound Processing

From physiology we learn that the auditory system extracts simultaneous features from the underlying signal, giving birth to simultaneous representations of audible signals. We also learn that pattern analysis and recognition are not separated processes (in opposition to the engineering approach of pattern recognition where analysis and recognition are usually separated processes). Furthermore,...

متن کامل

Source Separation with One Ear: Proposition for an Anthropomorphic Approach

Wepresent an example of an anthropomorphic approach, in which auditory-based cues are combined with temporal correlation to implement a source separation system. The auditory features are based on spectral amplitude modulation and energy information obtained through 256 cochlear filters. Segmentation and binding of auditory objects are performed with a two-layered spiking neural network. The fi...

متن کامل

Real - Time Pitch Detection

Finally, there is a definition problem between monophonic and polyphonic sounds. In the case of monophonic sound, the obvious definition is to pick the lowest partial as the fundamental frequency. In the case of polyphonic sounds, resulting either from one source (e.g., a piano) or from many sources (e.g., orchestra, choir), the definition is far more difficult, and approaches close to the prob...

متن کامل

Remixing musical audio on the web using source separation

Research in audio source separation has progressed a long way, producing systems that are able to approximate the component signals of sound mixtures. In recent years, many efforts have focused on learning time-frequency masks that can be used to filter a monophonic signal in the frequency domain. Using current web audio technologies, time-frequency masking can be implemented in a web browser i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neurocomputing

دوره 71  شماره 

صفحات  -

تاریخ انتشار 2007